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NOTE ON A CLASS OF POLYNOMIALS OF APPROXIMATION* 



DUNHAM JACKSON 

1. Introduction. In a recent paper f it was shown that if f(x) is a given 
continuous function in the interval a Ss x = b , n a given positive integer, and 
m a given real number greater than 1 , there exists one and just one polynomial 
<j) ( x ) , of degree n or lower, which reduces to a minimum the value of the 
integral 

f |/(3! ) -4>(x)\™dx. 
•/ a. 

The purpose of this note is to establish the truth of the same proposition in 
the case that m = 1 , for which the proof is considerably less simple. 

The function <f>(x) in the earlier paper was, more generally, a linear com- 
bination, with constant coefficients, of an arbitrarily given set of linearly 
independent continuous functions. The existence of at least one minimizing 
function is proved with the same degree of generality here. The argument 
involves no new difficulty, and in fact is slightly simpler than for m > 1 . For 
the proof of uniqueness, however, <j>(x) is essentially a polynomial as in- 
dicated, $ though a corresponding treatment could be given for other approxi- 
mating functions, such as finite trigonometric sums, which are similarly de- 
termined by their roots. 

2. Existence of an approximating function. Let p (x) ,pi(x), ■ ■ ■ , p n ( x ) 
be n + 1 functions of x , continuous throughout the interval a Si x Si b , and 
linearly independent in this interval. Let 

4> ( x ) = c pa ( x ) + ci pi ( x ) + • • • + c n p n (x) 

be an arbitrary linear combination of these functions with constant coefficients, 
and let II be the maximum of | </> ( x ) | in ( a , b ) . 

* Presented to the Society, September 8, 1920. 

t D. Jackson, On functions of closest approximation, these Transactions, vol. 22 
(1921), pp. 117-128. The main problem of the paper cited, going somewhat beyond the re- 
sult quoted here, had previously been treated, for the polynomial case, by Polya, in a note 
with which I was not acquainted at the time of writing: Sur un algorithme, etc., C o m p t e s 
Rendus, vol. 157 (1913), pp. 840-843. 

J The limitation is in the fact, not merely in the proof. For example, if / ( x ) = 1 , 
<t>(x) = ex, in the interval — 1 ^=x rgl, the integral of \f — <j>\ takes on its minimum 
value for any value of the coefficient c in the interval — 1 Si c :S 1 . Here <t> is still a poly- 
nomial, to be sure, but not an arbitrary polynomial of its degree. The coefficients for the 
best polynomial of the form c + Ci x are evidently uniquely determined, namely c = 1 , Ci = . 
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Then there exists a constant P , completely determined by the system of 
functions pa ( x ) , • • ■ , p n ( x ) , such that 

\c k \^PH (fc = 0,1, •••,«) 

for all functions <f>(x) . 

This lemma, due originally to Sibirani, is proved in the paper already cited,* 
and the proof need not be repeated here. 

Let 

16 



G = f \4>{x)\dx; 



then there exists a constant Q, completely determined by the system of 
f unctionsf p ( x ) , • ■ • , p n ( x ) , such that 

\c k \^QG (fc = 0,1, •-,«), 

for all functions <j)(x) . For J 

I Co I po(x)dx + d I pi(x)dx + ■ ■ ■ + c n I p n (x)dx\ 

*J a *Ja *J a 

r»x s*x r*h 

= | I <j> ( a; ) dx | Si I 1 ( a; ) | da; Si | |^>(a*)|rfa; = G, 

t^a «^a e/a 

and it suffices to apply the lemma previously stated to the set of linearly 
independent functions 



p ( x ) dx , ■ ■ • , I p n (x)dx, 

•'a 



the constant in this case being denoted by Q . 

Finally, let / ( x ) be a function continuous f or a 21 a; =1 6 , arbitrary at the 
outset, but to be kept unchanged throughout the remainder of the discussion; 
let M be the maximum of |/ ( x ) | in ( a , b ) ; let 



g = I I/O) - <M«)l<fo; 

J a 



and let A = b — a . Then, for all functions <f> ( x ) , 

M2iQ(<7 + MA), 



* Loc. cit., Lemma I; the notation has been changed somewhat. 

f It is understood that the definition of the functions includes a specification of the interval 
in which they are defined; when the p's are the successive powers of x , for example, the value 
of Q will depend on the interval in which they are considered. 

t Cf. loc. cit., proof of Lemma II. 
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where Q has the same value as before. For 

\cf>(x)\^\f(x) -4>(x)\ + \f(x)\, 

f \4>{x)\dx^g+ f \f(x)\dx^g + MA, 

and the preceding lemma is applicable. 

From this third lemma it follows that if a function 4> ( x ) is sought to make 
the value of g as small as possible, all the coefficients c , • • • , c n that come 
into consideration belong to a bounded region in the corresponding space of 
n + 1 dimensions, a region which may be regarded as closed. As g is a con- 
tinuous function of the coefficients, there will surely be at least one determina- 
tion of <f> for which g actually attains its lower limit y. Such a function 4> 
will be called for brevity an approximating function. 

3. Necessary condition for the existence of two approximating functions. 
Suppose there are two functions,* <j>o(x) and <f>i(x), linear combinations of 
the p's, for each of which g has the minimum value y . Let 



and let 
Then 

Hence 



<k(x) =-A<l>a{x) + <j)i(x)], 



n(x) =/(*) - <f>i(x) (i =0,1,2). 



n{x) =-[r (x) + n(x)]. 



\r2(x)\^-[\r Q (x)\ + \r 1 (x)\ ], 

the sign of equality holding, for any particular value of x , if r o n SO, while 
the relation is an inequality if r and n have opposite signs. If the inequality 
were to hold for any value of x , it would hold throughout an interval, because 
of the continuity of the functions, and it would follow that 

| T2 ( x ) | dx < y , 



X 



which is impossible. Consequently either of the remainder functions r (x) , 
ri ( x ) , must be positive or zero at every point where the other is positive, and 
negative or zero at every point where the other is negative. 

* In the earlier paper that has been cited, <f> m ( x ) denoted the function for which the inte- 
gral of the roth power of the absolute value of the error is a minimum. Here the exponent is 
restricted to the value 1, and the subscript is used without any such special significance. 
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Suppose there are two points X\ , x% , in the interval (a, b) , such that 

ro{xi) < 0, r (xi) > 0. 
Then 

n(a;i)^0, n(%)S0. 

There must be at least one point in the interval ( x\ , x?. ) at which r vanishes, 
and there must be at least one point where n vanishes. It is to be shown that 
there must be at least one point where both remainders vanish simultaneously. 
Let x[ be the upper limit of the values of a; in ( xi , x 2 ) for which r (x) < , 
and xi the lower limit of the values of x in ( x[ , x 2 ) for which r (x) > . 
Then r (x) is identically zero for x[ Si x Si x' 2 , is negative for points immedi- 
ately at the left of x[, and is positive for points immediately at the right of 
x' 2 . For definiteness, and to cover the less obvious case, it will be assumed that 
x{ < x'i . If x[ = Xi , so that the interval reduces to a point, the discussion 
is essentially the same, though a little simpler. In the interval x[ Si x Si xi , 
the function n ( x ) , being continuous, must either vanish somewhere, or be 
positive everywhere, or be negative everywhere. But if it were positive 
throughout the interval, it would be positive, in particular, at the point x[, 
and so, by continuity, would be positive at points immediately at the left of 
x[ , where r ( x ) is negative, and this is inadmissible. Similarly the hypothesis 
that n(x) < everywhere has to be rejected. It is certain, then, that 1\ (x) 
must vanish at some point of the interval in question. In every interval ivhere 
one of the remainders changes sign, there is at least one point where both remainders 
vanish simultaneously. At such a point, <£ and cf>i, being both equal to f, are 
equal to each other. 

To draw further consequences from the relation r riSO, let 

4>t{x) = 4> + t(4>i - 4>o), r t (x) =/(*) -4>t(x), OSiiSil. 

The function previously denoted by 2 ( x ) , for example, now has the repre- 
sentation <f>i(x). It will always be true that 

\r t (x)\ = (1 - t)\r (x)\ + t\n(x)\, 
and therefore 



r 



\r t (x)\dx = y. 



Everything that has been said about the pair of functions 4> , 4>i, can be said 
with equal force about the pair of functions 4>t corresponding to any two values 
of t in the interval ( , 1 ) . 

4. Uniqueness of the approximating polynomial. From now on the gener- 
ality of the discussion will be restricted, and it will be assumed that 

p k (x) = X* (fc =0,1, •••, n), 
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so that <j) ( x ) is a polynomial of the nth degree or lower. In other respects 
the previous notation will be retained, and it is to be shown that the hypothesis 
of the existence of two distinct polynomials of approximation, <f> and <fn , leads 
to a contradiction. 

There are two cases to be distinguished.* Suppose first that r (x) changes 
sign ?i+l times or more in (a,b) . By the italics of the preceding section 
there must then be at least n + 1 points where r and n vanish simultaneously, 
and so at least n + 1 points where 0o and 4>i are equal to each other. The 
identity of the two polynomials follows immediately. If there is any value of t 
in (0, 1) such that r t (x) changes sign n + 1 times or more, the same con- 
clusion can be drawn. 

Otherwise, suppose that r t (x) never changes sign more than n times. For 
any particular value of t, let r t (x) change sign just q times, q = n. It will 
be possible to select q points, x\,Xi, ■ ■ • , x g , so that r t (x) will be of constant 
sign (wherever different from zero) in each of the intervals a ts x Sk xi , 
Xi S x ^ xi , ■ ■ • , x q ^ x Si b , and will take on opposite signs in successive 
intervals. Let 

i> ( x ) = A ( x — xi ) ( a; — x 2 ) ■ ■ ■ ( x — x a ) , 

the constant multiplier A being chosen so that the maximum of | ^ ( a* ) | in 
( a , b ) is 1 , and so that \J/ ( x ) has the same sign as r t ( x ) at some point where 
rt(x) =f= 0. Then \p (x) will have the same sign as r t (x) at every point 
where r t (x) is different from zero. If r t does not change sign at all, it will 
be understood that if = ±1. 

Let e be any positive quantity, and let Ri be the set of points in ( a , b ) at 
which |r ( (a;)|=£, Si the complementary set, where |r 4 (.r)|<€. Then, 
in Ri, 

\r t (x) - el/(x)\ = \r t (x)\— e | \p ( x ) | , 

since r t and \p have the same sign, and \r t \ = e S e\\p\ . In Si, 

\r t (x) - e\p(x)\^\r t (x)\ +e|^(x)|. 



Hence 



f 

tJ a 



\r t (x) — ep{x) \dx 

= J + ^ I \r t (x)\dx — e | \\p(x)\dx+e J \ip (x)\dx , 



* If it were a question of the integral of the wth power of the absolute value of the error, 
m > 1 , the remainder for the approximating polynomial, if not identically zero, would neces- 
sarily change sign at least n + 1 times; cf. loc. cit., end of § 7, noticing that the polynomial 
there is of degree n — 1 at most. For m = 1 , however, it is readily seen that there are other 
possibilities. 
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so that 

I \r t {x) — ep {x)\dx < \ \r t {x)\dx 

unless 

J | \p ( x ) | dx = | \\f/(x)\dx. 

That is, the last relation must necessarily be satisfied, since otherwise the 
polynomial 

<t>t{x) + e\j/(x) 

would give a better approximation, in the sense of the integral of the absolute 
value of the error, than <j> t (x) , which is contrary to hypothesis. 

As e is arbitrarily small, it follows further that if R is the set of points where 
| r t ( x ) | > , and S the complementary set where r t ( x ) = , 

I | \p ( x ) | dx j£ I \ip{x)\dx, 
Js Jjt 

or, what amounts to the same thing, 

(1) f\+(x)\dx^±f h \f(x)\dx. 

In detail, the argument is as follows : Let ei , a , • • • , be a diminishing sequence 
of positive quantities approaching zero, and let Rj and Sj be the sets corre- 
sponding to tj, after the manner of the preceding paragraph. Then* R is 
the sum of all the sets Rj, each of which is included in the following, and S 
is the product of all the sets Sj, each of which is included in the preceding, 
and therefore 

lim mRj = mR , lim mSj = mS , 

j=K j=K 

the symbol m denoting measure of. As 

/-/+/ S-f-f. ' 

Jr *JRj <Jr-Rj 'Js JSj JSj-s 

and Iinij =tci m(R — Rj) = lim J=00 m(Sj — S) = , the desired relation follows 
immediately. 

From (1) it can be inferred that 

mS s h, 

where h is a positive quantity depending only on n , a , and b , and independent 

* It is convenient to use the language of the theory of Lebesgue integrals, though the 
problems of measure involved are very simple, since all the functions concerned are continuous, 
and each point-set that comes into consideration is made up of a finite number or an 
enumerable infinity of non-overlapping intervals and points. 
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of the coefficients in the particular polynomial \p • For, let 4'( x ) be an y 
polynomial of degree not higher than n , and let 

f \xP(x)\dx = G. 

*J a 

Then, for each coefficient c^ in \p , 

\c k \^QG, 

where Q is the constant of § 2. If W is the greatest value of any of 
the quantities 1 , [ x \ , ■ • • , \ x n [ in ( a, b ) , 

\xP(x)\m (n + l)QGW, 

and for any measurable set S in (a,b) , 

f\$(x)\dx^ (n + l)QGWmS. 

Js 

Hence (1) implies that* 

(n+ l)QGWmS ^\g , 

mS ~~ 



2(n + 1)QW 



and the last quantity may be taken as h . 

For every value of t in ( , 1 ) , then, it has been proved that f(x) must be 
identical with 4> t {x) throughout a point-set of measure at least h. But this 
is impossible. For the sets S corresponding to two different values of t can not 
have more than n points in common, since a common point is a point where 
the two polynomials 4>t(x) are equal, and hence the measure of the sum of 
the sets is the sum of their measures; and the necessary condition cannot be 
satisfied for more than N values of t , where N is the greatest integer contained 
in (b — a)/h. So the desired contradiction has been obtained, and the 
uniqueness of the approximating polynomial is assured. 

It may be emphasized once more that there is no assertion that the remainder 

for the polynomial of approximation must necessarily change sign n + 1 times ; 

it must either change sign n + 1 times, or vanish identically throughout a 

finite number or an enumerable infinity of intervals, having at least a certain 

specified aggregate length. 

The Univebsity of Minnesota, 
Minneapolis, Minn. 



* It may be assumed that g ^ 0, since there is no occasion for considering a function i 
that vanishes identically. 



